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Abstract 

We consider a nonlinear ordinary differential equation and want to 
control its behavior so that it reaches a target by minimizing a cost func- 
tion. Our approach is to use hybrid systems to solve this problem: the 
complex dynamic is replaced by piecewise affine approximations which al- 
low an analytical resolution. The sequence of affine models then forms a 
sequence of states of a hybrid automaton. Given a sequence of states, we 
introduce an hybrid approximation of the nonlinear controllable domain 
and propose a new algorithm computing a controllable, piecewise convex 
approximation. The same way the nonlinear optimal control problem is 
replaced by an hybrid piecewise affine one. Stating a hybrid maximum 
principle suitable to our hybrid model, we deduce the global structure of 
the hybrid optimal control steering the system to the target. 



1 Introduction 

Aerospace engineering, automatics and other industries provide a lot of opti- 
mization problems, which can be described by optimal control formulations: 
change of satellites orbits, flight planning, motion coordination (see e.g. [23] 
or [36] for more applications in aerospace industry). In general those optimal 
control problems are fully nonlinear ; since the years 1950-1970, the theory of 
optimal control has been extensively developed and has provided us with power- 
ful results like dynamic programming [9] or the maximum principle [37] . A large 
amount of theory has been developed and resolutions arc mainly numerical. 

We here consider a dynamical system whose state is described by the solution 
of the following ordinary differential equation (ODE): 

X{t)^f{X{t),u{t)) , 
X(0) = Xo ^ ' 
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We present a hybrid (discrete-continuous) algorithm controlhng the system (1) 
from an initial state Xq at time t = to a final state X/ = at an unspecified 
time tf. To reach this state, we allow the admissible control functions u to take 
values in a convex and compact polyhedral set V„i of R™, in order to minimize 
the following cost-function: 



There exists two main approaches for the resolution of nonlinear optimal 
control problems (see e.g. [44] for a review of different solving methods). 

A first class of solving methods is based on the Bellman dynamic principle 
and the characterization of the value function J(X, u) in terms of viscosity solu- 
tions of a nonlinear Hamilton- Jacobi-BcUman (HJB) equation [19, 8, 7, 15, 10]. 
The related numerical methods, called direct methods, often uses time and/or 
space discretizations and are really efficient in small dimension. Unfortunately, 
their cost in memory is in general too expensive in high dimension. 

The second class of methods is based on the Pontryagin Maximum Principle 
[37, 25]. This principle provides a pseudo-Hamiltonian formulation of optimal 
control problems and some necessary optimality conditions. In this context, the 
foremost numerical methods are the indirect shooting methods. These methods 
are very efficient in any dimension but require an a priori knowledge of the 
optimal trajectory structure. 

Today's challenge is to reach a compromise between direct and indirect meth- 
ods in order to enable the handling of high dimensional problems with a satis- 
factory precision and without any pre-simulation to locate optimal trajectories. 

In this paper, we propose a different approach: the use of hybrid computing 
[20]. The idea is to approach nonlinear systems like (1) by simplified models 
which we can analytically study. Basically, an analytical approach must allow 
to improve approximations as is done by [27, 21]: the level of details allows to 
reach a compromise between quantitative quality of the approximation and the 
computational time. The latter has been done e.g. for biological systems, where 
simplifications (in relation to real data and in regard of model simulations) are 
possible, see [21]. We here choose to use a hybrid system modeling i.e. to 
approximate the original system (1) by a continuous and piecewisc affine one. 
Contrary to [29] , we want to focus on both the discrete and continuous aspects 
of the hybrid systems. We thus first compute on the fly a polyhedral mesh of 
phase space and the control space and approach the global system by piecewise 
affine dynamics in each mode: in each cell of this mesh, the system is piecewise 
affine {X{t) = AX{t) + Bu{t) + c) and can be locally solved, mostly with 
analytical tools. We then find inside the cells the affine feedback controls a la 
[28] which enable a global controllability. Finally, to optimize the choice of the 
solutions, we derive from [38, Theorem 2.3] a hybrid maximum principle where 
the transitions are not constrained. 

The paper is organized as follows: §2 presents algorithms for the hybrid 
approximation of nonlinear system. Next, in section 3, the controllability of the 




(2) 
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system is studied and a new efficient algorithm computing a piecewise convex 
approximation of the controllable set is developed. The last section §4 is devoted 
to a heuristic approaching the resolution of optimal control problems: we first 
validate our hybrid approach for this resolution and establish a suitable hybrid 
maximum principle. We then show that the analysis of the optimality conditions 
gives us the general structure of optimal trajectories. 

2 Hybrid Approximation of Nonlinear Systems 

In this section, we model nonlinear control systems by way of piecewise affine 
systems. Our main objective here is to replace the nonlinear system (1) by 
another one, complex enough to reproduce the intrinsic properties of the initial 
system, but also simple enough to be analytically studied. 

Let us consider the general nonlinear control system X{t) = f {X (t) , u(t)) , 
where the admissible controls are measurable. Moreover we assume that the 
bounded functions u take values in a compact polytope Um of M™. Classical 
numerical methods like Euler or Runge-Kutta usually propose time discretiza- 
tions of nonlinear systems and then obtain discretizations of solution trajecto- 
ries. On the contrary, we here use hybrid computation introduced in [20] and 
whose main idea is to compute an approximation of the nonlinear field / by 
means of a state discretization. Extended to control systems, the principle is as 
follows: for a given mesh of the control and state space R" x Um, we compute 
a piecewise affine approximation of the nonlinear vector field /. This partition, 
associated to the so computed piecewise affine dynamic, defines a hybrid system 
which we can thus analyze. We thus first describe the construction of a piece- 
wise affine approximation of the nonlinear system. We then propose a complete 
study of the interpolation error and the convergence of the approximation. Then 
we develop algorithms building an implicit mesh of the state and control space 
R" X Um- This will enable us to define our hybrid model for nonlinear control 
systems. 

2.1 Piecewise afRne approximation of nonlinear systems 

In this paragraph we want to approximate the nonlinear system (1) by a system: 

Xit)^ UXit),u{t)) (3) 
where fh is a piecewise affine approximation of the nonlinear vector field /. 

2.1.1 Computation of the piecewise affine approximation 

There are many possible linearizations by part ; especially in the context of 
nonlinear systems without terms of control, [16] and [27] propose a linearization 
technique based on the interpolation of the nonlinear vector field at the vertices 
of a given simplicial mesh of the state space. This technique is extended in this 
paragraph to control dynamics like (1) and therefore enables the computation 
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of a piecewise affine approximation with respect to the state X and the control 
u. 

Let A — (Ai)ig/ be a given simphcial mesh of M" x Um. Any afhne function 
in dimension n + m is uniquely defined by its values at (n + m + 1) affine inde- 
pendent points. Thus, as done e.g. in [26], in each cell A^, we can compute an 
affine approximation fh of / by interpolation at the vertices of the cell. More- 
over, only one supplementary function evaluation can be sufficient to compute 
an approximation when the system is evolving to an adjacent cell. 

Then, we consider a simplicial cell A^ of A, defined as the convex hull of 
its (71 + m + f ) vertices: ci, . . . , an+m+i- Let /,; be the afhne approximation of 
/ computed by interpolation at the vertices of A^. We have that: fi{X,u) = 
AiX + BiU + Ci. The approximation fi is computed by interpolation of / at 
the vertices of the cell A^, so that: Vj, /(cj) = fi{<Jj)- Hence, this proves that 
/((Tj) = [Ai \ Bi ]uj+Ci which in turn induces that Vj = f , . . . , n+m+l, f{(Jj) — 
/((Ti) = \Ai\Bi ]((Tj— (Ti). We thus define Mi to be the (n+m)x (n+m) matrix, 
whose columns are the vectors of vertices: {oj — cti ; j = 2, . . . , n + m + 1} and 
Fi is the {n-\-m) x n matrix, whose columns are the vectors {/((Xj) — /(fi); J = 
2, . . . , n -t- rn + 1} so that Fi ^ [ Ai \ B^ ]Mi. 

Consequently, by linear independence of the vertices of the simplex A^, the 
square matrix Mj is non singular, so that we obtain the piecewise affine approx- 
imation of figure 1 by^: 

Control space 













f, 




State space 















[A,\B,] = F, Mr^ 

c, = /(ai) -[A,\ B, ]a, (4) 
h{X, u) = A,X + B,u + c„ if (X, u) € A, 

Figure f : Definition of the piecewise affine approximation 

2.1.2 Interpolation error and convergence of the Hybridization scheme 

For autonomous nonlinear systems: x{t) = g{x{t)), [27, §f l.f] presents a study 
of the approximation error inside a compact set D under some regularity as- 

^the Ci can be solved as well with any column as Vj = 1, . . . , n+m+1, f{o'j) — [ Ai \ Bi ]aj = 

Ci. 
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sumptions on the field g. In the context of nonhnear control systems X{t) = 
f {X (t) , u{t)) , the control is in general only measurable so that the required reg- 
ularity for / is lost. Regularity assumptions on the initial nonlinear field / are 
usually chosen to fulfill the conditions of the Cauchy-Lipschitz theorem applied 
to control systems, see e.g. [41, 44, 40]. From now on we therefore assume that 
/ is continuous, Lipschitz continuous over any compact set in X, uniformly in 
u. In other words, we have the condition (Ai): 

(Ai) m (E K", 3Ln > 0, V(Xi, X2) G n"^, Vu e U,„, 
\\f{X2,u) ^ f{Xi,u)\\ < Ln\\X2 - Xi\\. 

Then for any given control u, we can prove the existence and uniqueness of so- 
lution trajectories of the nonlinear system (1): we first establish the regularity 
properties of the piecewise affine approximation fh, required by the Cauchy- 
Lipschitz theorem. This then enables us to get an accurate bound on the error 
produced by our linearization scheme. We end this study by convergence results 
on our scheme. 



Let us introduce the size h of the simplicial mesh A = (Ai)ig/ defined by: 
h = supjg/ hi where hi = sup^ ||a; — y\\ and ||.|| denotes the 00-norm on 

We first need the continuity of the vector field fh which is an application 
of [27, proposition 11.1.3] to a state-control space: 

Proposition 1 fh is continuous over R" x Um and locally Lipschitz in X, i.e. 
for all compact subset D, in R", the restriction of fh to 57 x Um is Lipschitz 
continuous in X , uniformly in u. 

Then, from this continuities of / and //i, wc deduce the following bound on 
the magnitude of fh'. 

Lemma 1 / and fh are locally hounded in X , uniformly in u on R" x U„i, i.e.: 

yR > 0,3Cr > 0,y{X,u) e B(0,i?) x U™, \\fhiX,u)\\ < Cr 

Moreover, if f is bounded on R" x Um, then fh is also bounded on R" x Um 
(with the same bound). 

Consequently for any given control w, the conditions of the Cauchy-Lipschitz 
theorem are satisfied and wc therefore obtain the existence and uniqueness of 
solutions to the Cauchy problem: X{t) = fh{X{t),u{t)), X{0) = Xq. Now 
under some non restrictive assumptions on the nonlinear vector field /, our 
interpolation error can be evaluated with respect to the size h of the mesh A of 
R« X U„,: 

Proposition 2 (Interpolation error) i. If f is L-Lipschitz on R" x Um, 
then: 

sup \\f{X, u) - fh{X, u)\\ < ^^1" + ""^ h = e{h) 
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a. If f is continuously differentiable on R" X U,„, then for any compact set 
fl C M" , then there exists Lq > such that: 

sup \\f{X,u)-f^{X,u)\\<^^^^p^h^en{h) 



where: = max sup ||-Df!| • 

9'e//A,,n(OxO™)5^0 \(x.«)eA,, / 

The principle of the proof of proposition 2 is first to evaluate the interpolation 
error inside one cell Ag' of the mesh A. If the so computed error docs not 
depend on the given cell Ag/, then we deduce the global interpolation error ; 
otherwise the error is evaluated on any compact set Q x Um of the state and 
control space. See [40, proposition 2.3.1] for more details. 

Now, let Xq e M" be a given initial point and let the state X{.) (resp. 
Xh{.)) be the corresponding solution of the initial nonlinear (resp. piecewise 
afhne) system: X = f{X,u) (resp. Xh = fh{Xh,u)). Now by introducing 
the attainable sets A{XQ,t) and Ah{XQ,t) i.e. the sets of points that can be 
reached from Xq by the related system in time s < t, we obtain the following 
convergence results [40, proposition 2.3.2] of our piecewise affine approximation: 

Proposition 3 (C^ Convergence) 

i. If f is Lipschitz continuous in X, uniformly in u, then for all t > such that 
X{t) and Xh{t) are well defined: 

\\Xit)-X,m < ^(e^*-l) 
\\X{t)-X,,m < e{h)e'''. 

ii. Let f be Lipschitz continuous over any compact set 17 in X , uniformly in 
u (hypothesis Al) and assume that the attainable sets A{Xq^s)^ s > are 
compact. Then for t > 0, 

\\X{t)-X,(t)\\ < f^(e^"*-l) 

||i:(t)-i:^(t)|| < £0(^6""^*, 

where Q is defined by: il = A{Xo, t) U Ah{XQ, t). 

We can thus see that the key point is the definition of the mesh A and its size 
h. We now show that meshing the whole space is not mandatory. We rather 
compute the mesh on the fly. 

2.2 Implicit simplicial mesh 

Previously we have seen how to build a piecewise afhne approximation of the 
nonlinear dynamic / for a given simplicial mesh of the state and control domain. 
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In order to perform our hybrid approximation, we need now a method to build 
a simphcial mesh of R" x Um- As R" x Um is not bounded, it is algorithmically 
inconceivable to mesh the whole space R" x lLJ„i. Our approach is then to im- 
plicitly define a mesh of our space so that the simplicial subdivision is made on 
the fly 

Let /i > be the discretization step. There exists several ways to mesh a 
given domain. To each mesh corresponds only one piecewise affine approx- 
imation (3) by interpolation. Among all the possible meshes of the space 
R" X U„i, we choose to select those for which (0,0) is a fixed point of the 
system (3). Indeed, if the target is X/ = 0, we want to stay there once we have 
reached it. Therefore, u{tf) also has to be zero. A simple way to thus enforce 
//i(0, 0) = (0, 0) is to request the following property: 

Property 1 // G A.; then is a vertex of Ai . 

Lemma 2 Let us assume that /(O, 0) = 0. Let A be a mesh o/R" x Um that 
satisfies the property 1. Then (0,0) is also a fixed point of the piecewise affine 
system (3) built over A. 

Proof. At the vertices of the mesh, f{a) = fh{o') by the interpolating con- 
straints and (0, 0) is a fixed point of the initial system by definition. Therefore, 
/„(0,0) = /(0,0) = 0. □ 

Moreover we assume that the mesh A also satisfies the property: 
Property 2 Let D = {Dq)q^Q be the projection of A over the state space R". 

1. D is a simplicial mesh ofW"". 

2. Let {i,j) e I^. We state: A = PR"(Ai) and D.j = p^„{Aj). Then we 
have: Di = Dj or DiDDj =0. 

The property 2 is a geometrical constraint on the position of the cells A^ in 
R" X Um. Indeed, we now assume that, to each cell Di corresponds a "column" 
of cells Aj whose projection over R" is exactly Di (see for example, figure 2a). 
We note K.{q) = {i G I ; p^„{Ai) = Dq} the set of all the indices of cells A^ 
belonging to the same column q. Then each domain Dq x Um can be decomposed 
into exactly card IC{q) simplices in this way: 

Lemma 3 Dq x U„i = IJ Aq> . 

q'eK(q) 

The latter property will allow us in section 4 to have the same output constraints 
from a state cell Dq for each A,; in the column defined by K,{q). 

We are now able to define an algorithm computing an implicit mesh of the 
space R" x U,„. To satisfy property 2, we start by computing an implicit mesh 
D = (Dq)q£Q of the state space. Next, we compute an explicit triangulation 
of the control domain Um (small and bounded). We lastly deduce an implicit 
simplicial mesh {Ai)i^j of R" x U™. 



7 



(a) 




(b) 

Figure 2: Examples of meshes, for n = m = 1. with (a) satisfying the properties 
1 and 2 and where (b) satisfies property 1 and not property 2. 

2.2.1 Simplicial mesh of M" 

The state space R" is imphcitly cut into n-dimensional cubes. Each cube is then 
meshed into n\ simphces and the resulting partition is a mesh of R" (see [24], [27, 
chapter 11] for more details). Here, a n-cube C is defined by one point a = 
(ai)i=i...n & R" and the length^ h of its edges: C = [oi, ai + /i] x • • • x [a„, a„ + /i]. 
This will also be denoted by C = a + [0, /i]". In the same way we define a mesh 
of R" into ri-cubes: 

Definition 1 (Mesh of R" into n-cubes) Let a ~ {ai)i=i...n be a given point 
in R" and h > 0. Then (a + kh + [0, hy^)h=(ki,...,k„)£Z" is a mesh o/ R" into 
n-cuhes. 

Now, let us consider a n-cube C7 = [ai, ai + ft-] x • • • x [a„,a„ + h]. We then 
introduce iS„ the set of permutations of {1, . . . ,n}. For all Lp £ S„, Da^ip = 
{(xi, . . . , x„) e R"; < - 0^(1) < ■ • • < ^ ^^(n) < ft} is a simplex in 

R", whose vertices arc defined by: 

I Vi = l,...,p, 2:^(j) = a<p(i) p^O n (5) 

\ Vi = p+ 1, . . . ,n, = ay(i) + ft ' 

Note that property 1 is satisfied if a = 0. Now each n-cube can be meshed into 
n! simplices [24]: 

Proposition 4 (-D6.(^)(^gs„ is a mesh of size y/nh of the n-cube C ~ b+ [0, ft]". 

We then check that the respective meshes of two adjacent 72-cubes coincide at 
their intersection so that we deduce a global mesh of R" as in [40, proposition 
3.1.2]: 

^The Euclidean norm is used to compute lengths or sizes in the mesh. 
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Proposition 5 (-Dfc,ip)fceZ",<pe5„ a simplicial mesh of size y/nh of the state 
space R". 

2.2.2 Triangulation of U„ 

The control domain Um is a bounded convex polyhedron, defined as the convex 
hull of its vertices. The study of simplicial subdivisions of such polytopes has 
been extensively developed in recent years and provides us with some efficient 
tools to compute them (e.g. Delaunay triangulation of the Qhull [6] software'^). 
To build a regular mesh of size h of Um, we use an algorithm combining a 
Sierpinski-like discretization with the Delaunay triangulation. 

Remark 1 The triangulation of the control domain is computed once and for 
all at the beginning of the algorithm: indeed, in every control problem, there 
is in general no initial condition on the control. Therefore in each state cell, 
we need to compute all the possible related control cells and the triangulation 
can not be computed on the fly as for the state. This is possible for the control 
polytope as it is bounded and usually much smaller than the state space. Also 
this same explicit triangulation will be reused in every column cell of our hybrid 
automaton. 

2.2.3 Simplicial mesh of R" x U™ 

Let {Dq)q^Q and (Uj)j<zi be the respective simplicial subdivisions of R" and Um. 
The triangulation (A^),; of the state and control space R" x U,„ is built without 
new vertices via the Delaunay triangulation. This last criterion guarantees the 
property 2 to be satisfied and gives the following size of the resulting mesh: 

Lemma 4 Let Dq and Uj be two polytopes respectively in R" and R"'. Then 
we have: 



Combining this lemma with proposition 5 gives that our mesh A is of size 



In this section, we have described the main steps of the construction of an 
implicit simplicial mesh. Next, we will see that this mesh and the correspond- 
ing linear approximations, actually define a hybrid approximation of the initial 
system (1). 

2.3 Hybrid Automaton 

2.3.1 Definition of the hybrid model 

The state of a hybrid system is described by two variables: the first one, de- 
noted by q{t), is discrete and takes values in a countable set Q. The second one, 

•^tool for computation of convex hulls, Delaunay triangulation, Voronoi diagrams in 2d, 3d 
or higher dimension. 





y/n + Ih. 
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denoted by X{t), is continuous and takes values in the space K". In each dis- 
crete mode q G Q, the variable X evolves continuously according to a dynamic 
fq specific to the mode q. Therefore the hybrid dynamic and consequently the 
solving methods, depend on the choice of the discrete set Q. Since the control 
functions are generally measurable and not continuous (at best piecewise con- 
tinuous), trajectories {X{.), u{.)) in the space R" x Um are a priori discontinuous 
whereas the trajectories X(.) are continuous in the state space. Furthermore, 



u, 



u 








^ / 













\ — 




mill 




D2 


^0 




X 



Figure 3: Example of discontinuous trajectory in the space K x [0, Umax\ 

as illustrated on figure 3, the notion of transitions between two cells of the mesh 
A has no meanings with respect to the control. Indeed, an admissible control 
function can switch at any time to a given cell to another Aj, not neces- 
sarily adjacent to A^, and without reaching one boundary of the system. In 
consequence the choice of the continuous variable, describing the state of the 
hybrid system, is restricted to the state variable X G M". We then introduce 
the following notations: 

1. Q the countable set of indices of a simplicial mesh {Pci)qeQ of the state 
space M". 

2. £ = q') e Q X Q ; dDq n dDqi 7^ 0} the transition set: a transition 
between two discrete modes q and is allowed if the associated cells Dq 
and Dq' are adjacent. 

3.1? = {Dq ; (7 e Q} the collection of simplicial cells of the mesh of M". By 
construction, each domain Dq induces a set of afiine constraints on the 
state X in the mode q: = {X G R" ; NqX + Lq<0}. 

A. U ^ {Urn ; q E 0} C Vm the collection of control domains. Here, in each 
mode q the control domain is unchanged. 

5. J- = {fq ; q E Q} the collection of piecewise afSne field vectors defined by 
interpolation of the nonlinear field vector / (sec section 2.1): 

Vg eQ, fq-. Dqx Vra ^ 

{X,u) ^ MX,u) 
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According to lemma 3, the vector fq is locally defined by: Vg' G /C(g), V(X, u) S 

Aq, Jq{X,u) = Aq,X + Bq,U + Cq^ . 

6. G = {Ge ; e G 5} the collection of guards: Ve = {q,q') G £,Ge = 
dDqndDq,. 

Remark 2 The definition of hybrid systems proposed at first by M.S. Branicky 
and al. in [13], suggests the introduction of the collection TZ ~ {Rg ; e G 
of Reset functions here defined by: Ve — {q,q') € SyX e Ge^ReiX) = {X}. 
Indeed in our model, at any transition e ~ {q, q') e £, the corresponding cells 
Dq and Dq> are adjacent so that we do not need to reinitialize the continuous 
variable X. 

We can now define our hybrid model and the associated notion of solution, 
extending that of [27, §2.1]: 

Propositions Ti ^ {Q,£,'D,IA,!F,Q) is a hybrid system approximating the 
nonlinear system (1). 

Definition 2 A solution (or execution) of the hybrid system Ti. is a hybrid 
trajectory x = [t, q, X) where: 

• T = ([ti, ti+i])i=o...r (ii < ti+i if i < t) is a sequence of real intervals. 

• q : T Q describes the discrete state ofTi. and X : t ^ R" the continuous 
behavior of Ti.. 

and satisfying, for all i G {0, . • • , r} such that ti < ti+i, 

• For t €\ti,ti^i[, q[t) is constant and X[t) remains inside Dqi^^y 

• there is a measurable function u :]ti, t^+i [— > U„i such that: Vt ti+i [, X{t) = 

fq^t){X{t),u{t)). 

At any time t, the state of the hybrid system Ti. is described by its position 
X{t) in the state space M" and the associated discrete mode q{t) = q G Q. For 
each nonempty time interval ]ti, ti^i[, the system Ti. is in mode q{t) = qi at the 
position X{t) G Dg- . In this mode, there exists an admissible control u such that 
Ti, evolves continuously according to the dynamic: X{t) = f q.{X [t] , u{t)) . As 
soon as the system reaches a guard at time ti+i, a discrete transition 

e = {qi,qi+i) G £ may occur from the mode qi towards the mode qi+i and we 
start again the same process from the position Xiti^i) in mode qi+i at time 
ti+i- Note that ti and ti^i are the respective entering and exiting times in the 
cell Dq^ . 
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2.3.2 Computation of hybrid data on the fly 

Let Xq be the current position in the state space M" of the hybrid system 
Ti, defined in the previous section. In this section we first present an explicit 
algorithm to compute on the fly the current discrete mode associated to the 
position Xq i.e. the cell Dq which contains the point Xq- We will assume in 
particular that the hybrid automaton is not Zeno (no infinite number of switch 
can occur in a finite time, see e.g. [27, §2.3] and [46]). Then we want to compute 
the local control constraints induced by each cell Ag' in the mode q. 



Computation of the current state The principle of the algorithm is based 
on the definition of the implicit mesh A of the state and control space R" x Um 
(see section 2.2): we first compute the n-cube and then the simplex inside this 
cube that contain the given position Xq. 

Computation of the n-cube C such that Xq € C: according to the definition 
1, the n-cube C is defined by one origin point and the length h of its edges. 
Here the origin of the mesh A has to be chosen at the origin of M" x Um, 
so that the searched cube C can be written as: C = kh + [0, h]" , where kh = 
{kih, . . . , knh) £ hZ". Since we want to compute all the cubes C to which 
Xq = {xi, . . . , Xn) G belongs, we necessarily obtain: 



Vi e {1, . . . ,n}. 



if a-j ^ /i2 



fciGj — — 1,— } otherwise 
h h 



Computation of the simplex Dg such that Xq G Dg: let C = kh + [0,/i]" be a 
n-cube containing Xq. According to the proposition 4, C can be divided into 
n! simplices defined by: 

e Sn,D^ = {X e R" ; < - fc^(i)/i < . . . (6) 

• • • < X^(^n-j — k^(^n)h < h} 

To avoid the explicit computation of all the contained in the cube C, the trick 
is to directly compute the permutation ip which defines the searched simplex: 

1. Form L :~ [xi — kih, . . . , a;„ — knh]; 

2. Apply a sort by selection algorithm to the list L. We so obtain the sorted 
list L and the permutation ip that performs the sort on the list L. 

3. Deduce the simplex satisfying: Xq G Dip 



Computation of the local constraints on the control We consider a 
given cell Dq in the state space and again JC{q) = {q' £ I / p^„{Aq>) ~ Dq} 
is the set of cells in A whose projection in M" is exactly Dq . By construction, 
in each cell Aq, the control depends on the state. In this paragraph we want 
to compute the explicit control constraints. For that matter, we define Uq'{X) 
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to be the set of control constraints in the cell A,/ for a given X E R": VX S 
Dq, \/q' e IC{q), Uq.{X) = {u e / {X,u) S A,/}. The problem now is to 
determine the geometrical structure of Uqi{X) in M™. For instance, on figure 
4, we can remark that the control domain Uqi{x) is a segment (namely an 1- 
simplex) in R when x €]afc,afc + h], and is reduced to one point {u^}, when 




(a^, u.) (aj+h, u.) 



e A,/ 

X e Dq = [flfe, flfc + /l] 

and 

u e Uq'{x) = {m G Ui; 0<M — Ui<a; — Ofe} 



Figure 4: Definition of and Uq' {x) in cell A^/ for n = m = 1 

Let us take X e and g' G IC{q). By definition. Ag' is a simplex in 
R""*"™, so that it can be defined by a system of (n + m + 1) affine independent 
inequahties: MY + d>0 where M e M„+„+i,„+„i(R) and d € R"+™+i. We 
now define the left and right parts of M as follows: 



Ml = M 











so that: M = [ Mi | Ah ]. Thus we have V(X, li) G A,-, MiX + A^u + d>0. 
And we hence obtain a characterization of ?7o' : 



yx e Dq,Uq4X) ^{ue R'"/(x,m) e A,.} 

= {u e R^/MzM + {AIiX + d) > 0} 



(7) 



Therefore Uq' (X) is determined by a finite number of affine inequalities and is a 
polyhedral set. Moreover, since A, is bounded by construction, it follows that 
Uq' {X) is a bounded polyhedral set, i.e. a polytope. Furthermore, it also has 
the following property: 

Proposition 7 For all X E Dq. Then, for all q' E IC{q), Uq'{X) is a m-simplex 



The proof is given in appendix A ; the idea is to count the number of intersections 
between the whole control space and the n-faces of the simplex A^/ . 

In this section, we have proposed a model of hybrid automaton for the ap- 
proximation of nonlinear control systems X ~ f(X,u). The construction of 
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the hybrid model is performed on the fly, so that at any time we are able to 
determine the state and the associated local control constraints. Next section 
addresses the controllability problem, namely the existence of (non compulsorily 
optimal) solutions via this hybrid approximation. 



3 Approximation of the controllable domain 

In this section, we focus on the controllability of the nonlinear system (1) to- 
wards a given polyhedral target. For a given initial point Xq G in the state 
space, we want to study the existence of admissible trajectories of the system 
that steers Xq to the considered target at some finite time. The theory of lin- 
ear control systems without constraints on the control have been extensively 
developed and provides some powerful results like the Kalman controllability 
criterion [31, 32] for autonomous linear systems like X = AX + Bu. The use of 
this criterion can be extended to nonlinear control systems by using local con- 
trollability techniques an so provides local controllability results [33, 44] . Some 
global results can be obtained for a class of systems which depends afhnely on 
the control [30, 34]. 

Our goal is to propose a constructive approach of nonlinear controllability by 
the study of the controllable domain (the set of initial points that can be steered 
to the target). We thus present now methods and algorithms to approximate the 
controllable domain of nonlinear systems by the way of the hybrid model defined 
in section 2.3. We first present our hybrid approach for the controllability of 
nonlinear systems: the nonlinear controllable domain is approximated by the 
hybrid one. Then, after stating some useful properties of the hybrid controllable 
domain, we develop a new algorithm computing a convex approximation of the 
hybrid controllable domain. Some experimental results will then validate our 
algorithm. 



3.1 Hybrid approach to the nonlinear controllable domain 

In section 2, we have built a hybrid model approximating every nonlinear control 
system like (1). We now need to verify that the use of our hybrid model enables 
the approximated solving of the initial control problem ; namely we analyze the 
approximation of the nonlinear controllable domain by the hybrid one. Let us 
specify the notion of controllability and controllable domain to a given target 

Definition 3 X^ G R" is controllable to Tf if and only if there exists T > 
and u : [0,T] U,„ measurable, such that the following system (S) admits a 
solution X{.) over [0,r]; 

/cJ Xit) = f{X{t),u{t)) 
\ X{0) = Xo, X{T) e Tf 

The set of controllable points in M" to Tf is called the controllable domain of 
the system (1). 
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This definition can be applied to any control system, and particularly to the 
hybrid automaton previously described. Therefore, any Xq £ R" is controllable 
to the target 7} for the hybrid system H if and only if there exists an admissible 
hybrid trajectory, in the sense of definition 2, that steers Xq to 7} in some finite 
time T > 0. In particular, for any controllable point Xq G M", the hybrid 
system may accept several executions that steer Xq to the target, and several 
different cell paths q ~ {qi)i between Xq and Tf. 

3.1.1 Global error of the hybrid approximation 

Let C7vl(R") and C>^(R") be the respective controllable domains of the nonlinear 
system (1) and its hybrid model H towards a given target Tf. To validate the 
hybrid approximation, we need to evaluate the distance between Cml(W^) and 
CuiW^), namely their Hausdorff distance defined by: 

dH(CjvL(K"),C^,(M")) 

= max sup inf — 

\XGC„(R")'^eC«i.(K") 

sup inf \\X - Y\\ I 

yGCivL(K")'^6C«(»") / 

Let Xq e Cml{W^) be a controllable point to 7} by the nonlinear sys- 
tem (1). Hence, according to definition 3, there exist a finite time T > 
and an admissible control uq : [0,T] — > U™ such that the system: X{t) = 
f{X(t),u{t)), X{0) = Xq has a solution X satisfying: X{T) e Tf. Then we 
can easily build a new point Y/j^o that is controllable by the hybrid system Ti 
in the same time T and control uq as for Xq: Yhfl = Y{T) where Y is the 

unique solution of the system: Y{t) = -fh{Y{t),uo{T - t)), Y{0) = X{T) 
obtained from (1) by time reversal on the time interval [0, T]. Wc thereupon 
deduce: Yh^ e C^^(K") and infygc„(R") ll^o - Y\\ < \\Xo - Yh^o\\. According 
to the convergence results established in proposition 3, and assuming that / is 

e(h) 

Lipschitz in X, uniformly in u, we thus obtain: \\Xo - Yh,o|| < -^(e^^ - 1), 

L 

where s{h) denotes the interpolation error between / and its piccewise affine 
approximation fh ; wc consequently have shown that for all Xq e Cnl(M-^^)' 

3T>0, inf ||Xo-r|| < ^(e^^-1). 

Moreover, starting from a controllable point of the automaton 7i, wc prove in 
the same manner that: yYh,o e Cw(K"), > 0, infxec«i.(R") 11^ - ^oll < 

^(e^^--l). Since the final T is a priori unbounded, we introduce the notion of 
L 

controllable domain in time less than or equal to T > 0, denoted by Cjvl,[o,t] (R") 
in the nonlinear case and C-^ [q.t] (IR") in the hybrid one. We have shown: 
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Proposition 8 Let the nonlinear function f : M" x U„i M" be Lipschitz 
continuous in X ^ R", uniformly in u £ U™. Then: 

e^^ - 1 

dH (CArL,[0,T](K"),C«Jo,T](R")) < e{h)^—, 

where the interpolation error e{h) satisfies: lim e{h) ~ 0. 

Using the second part of the proposition 3, it is then straightforward to gener- 
ahze this result as: 

Proposition 9 Let f : M" x U,„ —>■ M" be Lipschitz continuous over any com- 
pact set in X Cz M", uniformly in u G 1U,„. Then there exists Lq > such 
that: ^ ^ 

dH (CArL,[o.T](K"),C«,[o.T](K")) < en{h f " ~\ 

where 57 C K" is the compact set defined by the union o/ C7vl.[o,t](II^") o,''^d 
Cw,[o,T](K"). 

3.1.2 Exploration reduction: controllable domain in finite paths of 
modes 

We have shown that the hybrid controllable domain is a valid approximation 
of the nonlinear one. From now on, we consider that a given point Xq in the 
state space is approximatively controllable by the initial nonlinear system (1) 
to the considered target if it belongs to the controllable domain of its hybrid 
approximation. Unfortunately the controllable domain of a hybrid system, like 
its attainable set, are rarely computable. One of the main reasons is that the 
number of discrete transitions needed to reach some modes of the hybrid system 
is unbounded [27, chapter 7]: by time reversal, the computation of the control- 
lable domain comes down to the computation of the attainable sets from the 
target. Then, if the given point Xq is "far from the target", all the admissible 
paths of cells between Xq and the target have to be explored. Hence an infinite 
number of discrete transitions are to be simulated and the computation can not 
be done in finite time. Therefore an approach that is more favorable to the 
simulation is required. We first assume that: 

{A2) The target Tf is a subset of some cell Dq in the 
state space W\ 3g e Q,Tf C Dg 

The assumption {A2) is non restrictive: indeed, if there are several cells Dq 
whose intersection with the target 7/ is non-empty, then Tf can be decomposed 
into the union of its intersections with each cell: Tf = UgeQ/_D,nr/7t0 (^9 ^ ^f) ■ 
There, we have several targets to reach (as many new targets as intersections) 
each of those satisfying the hypothesis (^2)- 

We first consider a given mode g G Q of our hybrid model Ti and a local 
target Tq (i.e.: Tq C Dq). We introduce the concept of controllable domain in 
the given mode q: 
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Definition 4 (Controllable domain in a mode q) Let q E Q be a given 
discrete mode of our hybrid model Ti. and Tq a subset of Dq. Xq £ Dq is 
controllable to Tq in mode q if there exist a finite time T > and an adm,issible 
control u such that the hybrid problem: 

X{t)^ fh{X{t),u{t)), X(0) = Xo, X{T)(,Tq 

has a solution, denoted by X^, satisfying: \/t e [0,T], Xh{t) G Dq. 

The controllable domain in mode q is the set of all the controllable points in 
mode q and is denoted by CuiTq, Dq). 

Hence, any point in Dq from which the local target 7q is attainable while re- 
maining within the cell Dq, necessarily reaches 7qi D Tq ; hence: 

Lemma 5 Tq c Tq, c Dq ^ Cn{Tq,Dq) c Cn{Tq,,Dq) 

The definition 4 can be easily extended to any subset of the state space, and 
especially to the set Q.^ associated to 7 = {qi)i=i...r- This set, shown on figure 5, 
is defined by fJ-y = IJl=i where 7 = (<?i)i=o...r is a finite sequence of adjacent 
modes of our hybrid automaton Ti. satisfying Vi = l,...,r— 1, Dq. H Dq.^.^ 7^ 0. 




Figure 5: Definition of the domain fl^ along the path 7 = {qi)i=o...s 



Let be a given subset of flj and the target for the controllability problem. 
We introduce the set C-H{Tf,^j) of points that can be steered in finite time by 
the hybrid model Ti. to the target Tf while remaining within f2^, in the given 
sequence of modes. Our idea is to build an approximation of the controllable 
set inside ft^, recursively defined along the path. Assuming without loss of 
generality that: Tf C i'qo, the principle consists in starting from the cell Dq^: 
we first compute the local controllable domain in mode qo. Then its intersection 
with the current guard G(^qg^q-^-) defines a new target now in mode qi and we 
pursue the approximation the same way (the spreading rule is illustrated on 
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figure 6). The algorithm stops when the last cell is reached or when the 
intersection between the current controllable domain and the next guard is 
empty. 




Fi gure 6: Spreading rule for the controllable domain from mode qi to mode qi+i 



This induces a recursive definition of an under-approximation of the hybrid 
controllable domain inside f2^: 

Proposition 10 We define: C^(O^) = CH{%,o,Dqo) U C-h (Xji , -Dqi ) U ••• U 
CniTq-r J Dq^), where CuiTqi , Dq^ ) denotes the set of points inside Dq. from which 
the local target 7q. C Dq. is attainable in finite time while remaining within the 
cell Dq. . The current target 7q. is recursively defined by: 

\ n+i=Cnin,DqJnG(^q.^q^_^^), i = 0,...,r-l 
C_f^{fly) is an under-approximation of Cui^'y), hence : C^(r2^) C Cni^^)- 

Proof. We proceed by induction. Let us define for i = 0, . . . , r, the property 
V{i) : C«(r,„,i?,JU---UC«(r,^,L',J cc«(r!^). 

First, let G C-uiTqo^ ^go)- Since Dq„ C and Tq^ ~ 7}, the point Xq 
is actually controllable to the Tf while remaining within 17-^. This proves that 
Xo e Cu{%) and 7'(0) is true. 

Second, let us assume that the property 7^(1) is satisfied and Xq G IJl-t^o ^T-L^q 

By assumption, if G Ufc=o'--'>^('^i ' -^9fc) ^'^'^ ^0 G Cui^l-i^- We can thus 
assume Xq € C-HiTqi+n Dq.^.^). By definition of the local controllable domain 
inside the cell Dq.j_^, there exists Yq G ^i+i such that the hybrid system Ti 
steers Xq to Yq while remaining inside Dq.^^^ and henceforth inside Q,^. More- 
over by construction: T^^+i = Gq^^q^^^ n Cni%]i, Dq.), so that Yq is steerable to 
the target 7} while remaining inside fi^ . By transitivity, we conclude that Xq is 
controllable to the target 7} while remaining within Qj: Xq G Cu{Tf i^-y) and 
the property V{i + 1) is satisfied. □ 

The decomposition of the controllable domain in f2^ in local controllable 
sub-domains expressed in proposition 10 draws the structure of the algorithm 
presented in section 3.2 for the hybrid controllable domain computation. Actu- 
ally, as shown next, it is also possible to use this spreading in different directions. 
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3.1.3 Expansion of the controllable domain in r iterations 



In paragraph 3.1.2 we have proposed a constructive approach of the hybrid 
controllable domain to a given target Its key points are described by the 
proposition 10. In practise, this process may be used in two different manner 
illustrated on figures 7 and 8. 




Figure 7: Controllable under-approximation in a given finite cells path 7 = 
(gj)i=o,...,8- 

The first is an extension of the process described previously: we are given a 
finite sequence 7 = (gi)i=o,....r-i of adjacent modes of the hybrid model 7i and 
we compute the controllable under-approximation as expressed in proposition 
10 and illustrated on figure 7. A weakness of this method is the arbitrary choice 
of the path in which we want to compute our controllable under-approximation. 
As we will see later a way to exhibit such a path is to start with a cell large 
enough to contain both the source and the target. Then such an approximation 
at a given level could guide the selection of an interesting path at a more refined 
approximation level. 



Figure 8: Controllable under-approximation in 3 iterations from the target Tf. 

The second idea is based on a more global approach of the hybrid control- 
lable domain. There, we compute an under-approximation of the global hybrid 
controllable domain Ch{W^) by time reversal starting from the cell containing 
the target Tf. The difference is that we only impose a maximum distance to 
the target. The principle is illustrated on figure 8: we compute the cell Dq„ of 
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the state space containing the target (7/ C Dg^). Starting from Dq^ by time 
reversal, we compute the local controllable domain to 7} in this cell and then 
its intersections with all the guards (i.e. facets) of Dq^. Each non empty inter- 
section defines a new target and a new cell at the next iteration. This process 
is pursued at the p-th iteration from the controllable domains computed at the 
[p — l)-th iteration. 

3.2 Computation of a convex approximation of the hybrid 
controllable domain 

We have studied the approximation of the controllable domain to a target Tf 
of a nonlinear system by the controllable domain of its hybrid approximation. 
In view of simulations, we also developed a more constructive approach for 
the hybrid controllable domain computation. This approach is based on local 
controllable domains propagation and reduces the state space exploration. Now, 
we want to compute the set of controllable points (or attainable points by time 
reversal) in R", i.e. the set of initial points that can be steered to a given target 
Tj in finite time. The considered class of systems is the one of piecewise affine 
hybrid systems defined in section 2.3. 

A first method for the attainable sets computation consists in extracting, 
by abstraction, a simplified model equivalent to the initial one for the regarded 
property [2, 3, 43] . The main problem is to find the right abstraction that enables 
to conclude under a reasonable complexity. Another class of methods developed 
for safety verification is based on approximations by unions of polytopes. For 
instance, [4] compute an over-approximation of the attainable set and can thus 
certify that the system can not escape from an admissible set of states. On 
the contrary our idea is to guaranty as much as possible the controllability of 
a given initial point. In [22], we proposed an algorithm to compute an under- 
approximation in time T > of the controllable set without state constraints. In 
[39], an extension of this algorithm to piecewise affine systems under constant 
control constraints is presented. Here, these techniques are adapted to our 
hybrid model Ti. for bounded convex polyhedral targets. From now on, we 
assume: 

(A3) The target Tf is a bounded polytope o/M" defined 
as the convex hull of its vertices. 

3.2.1 Convex approximation within a given state cell 

Let us consider a given discrete mode q e Q of our hybrid model, or cquivalcntly 
a given cell Dq in the state space. Let Tg C Dq be a local target in mode q 
satisfying the hypothesis (A3): Tq = Conv{sg^i^ . . . , Sg^k)- In this paragraph, we 
address the problem of approximating the local controllable set C-H{Tq, Dg) i.e. 
the set of points in Dq steerable in finite time to 7^ while remaining within 
Dg. Our main idea is to compute some specific controllable points in Dg ; their 
convex hull defines a convex approximation of the hybrid controllable domain 
in mode q. 
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Now, any point Xq in Dq is controllable to a target Tg in mode q if it is 
attainable from Tq by time reversal. Consequently, a simple way to compute 
controllable points in a mode q is: for an admissible control Uj, we compute by 
time reversal the hybrid trajectories X[sq^i^Uj] coming from the vertex Sq^i of 
Tq. We then introduce their intersections, when they exist, with the boundary 

of Dq -. 

X,,j^X[Sq^„Uj]iT,) (8) 

where = sup{i > 0; X[sq^i, u]{t) G dDq}. As illustrated on figure 9, when the 
trajectory X[sq,i, Uj] remains inside Dq on a non empty time interval, all points 
between Sq- and the corresponding intersection Xij is controllable to the target 
Tq. Our approximation will be the convex hull of all the points computed this 
way. 




(a) (b) 

Figure 9: Principle of the convex approximation in a state cell Dq: (a) along 
the trajectory X[sq^i,ui], only points between Sq^i and Xi are controllable to 
Sq,i & Tq (b) Convex approximation 



The main difficulty in this approach is the choice of control functions u. 
Indeed they have to be simple enough to enable the efficient computation of 
the related trajectories, but also significant enough to obtain a good quality 
approximation of the hybrid controllable domain. According to the construction 
of the mesh A = (Aq/)g'g/ of the state and control space developed in section 
2.2, there exists a column of cells {A^/ ; q' G ^{q)} in M" x Um whose projection 
on the state space R" is exactly Dq. We recall that the cells of this column are 
defined by their indices this way: JC{q) = {i € I \ p^„(Ai) = Dq}. By definition 
of our hybrid model, in each cell A^' of this column, the hybrid dynamic fh is 
affinc. Our idea is then to choose a control function such that the control and 
state trajectory remains within only one cell A^/ (g' G /C((?)) while 

X[t) remains within the state cell Dq-. 

ar > 0, Vt e [0,T],X(i) e and (A:(i),u(t)) e A,/ (9) 

Wc thus ensure that the hybrid dynamic is affinc over the time interval [0, T]. 
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Our method is then to consider feedback affine controls: 



u^{t)^FjX{t)+gj 



(10) 



such that {X{t),u{t)) evolves on the edges of our state and control mesh A. 
Such controls at the position X{t) correspond to the vertices of the control 
domains Uqi{X{t)) defined in paragraph 2.3.2 and their definition is illustrated 
on figure 10. 



Control domain 



u-,XX{t)) 




State space 



Figure 10: Control Ui{t) chosen for the computation of a convex approximation 
of the hybrid controllable domain in mode q 



The principle of our approximation algorithm 1 in mode q is as follows: for 
each previously defined control Uj and each vertex Sq^i of the target 7^, 

1. Test if the trajectory by time reversal Xlsq^i^Uj] remains inside Dq on 
a non empty time interval (notice that for this test, wc do not need to 
exphcitly compute the trajectory, see [40, chapter 5, paragraph 5.1.2]). If 
not, stop. 

2. Computation of X[sq^i^Uj\ by solving: X{t) ~ AqiX{t) + BqiUj{t) + Cq', 

i.e. X{t) = (Aq, + Bq,Fj)X{t) + {Bq,gj + Cq'). 

3. Computation of its first intersection with the boundary of Dq. 

Let A(7^, Dq) be the convex hull of the so computed controllable points Xi j 
defined by (8) and computed by the algorithm 1. By construction, we deduce: 

Proposition 11 A{7q,Dq) is a convex approximation of the hybrid controllable 
domain C-ui'Tq, Dq) in mode q. Moreover if Cu{Tq, Dq) is convex, A{Tq,Dq) is 
a guaranteed under-approximation; 

Cn{%i,Dq) convex =^ A{Tq, Dq) d CniTq, Dq). 
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Algorithm 1 ConvexApproximation: Convex approximation of the hybrid 

controllable domain in a mode 

Require: H, q a discrete mode of the hybrid automaton H, Tq = 

Conv(sq^i, . . . , Sq^k) thc local target in mode q. 
Ensure: A convex approximation of thc controllable domain C-h{%}, Dq). 
1: Initialization: A := 0; 

2: {Computation of the indices q' e !^{q) of cells A^/ whose projection is 
exactly Dq :} 

ICiq) := {q' & I / p^^Aq,) ^ Dq} 
3: {Set of already tested controls:} S := 0. 
4: for all q' e IC{q) do 

5: {Computation of the polytope Uq>{X) of control constraints (see §2.3.2):} 

Uqi{X) :~ ControlConstraints{Aq'); 
6: {For each non tested vertex of Uq>{X),} 
7: for all i from 1 to card{Uq'{X)) such that Uq'{X)[i] ^ S do 
8: {i-th vertex of S:} u := t ^ Uq'{X{t))\i]; S := EU {u}; 
9: {For each vertex Sqj of the target 7^,} 
10: for all j from 1 to k do 

11: if the trajectory X[sg j, u] remains in Dq on a non empty time interval 

then 

12: {Computation of the intersection of X[sq,j, u] with dDqi} 

{Xf, ExitFace) := OutCeW {7i^ q, Sq,j, u); 
13: {Xf, a it exists, is controllable,} 

14: if X/ 7^ then A := AU {X/}; end if 

15: end if 

16: end for 
17: end for 
18: end for 
19: Return Conv(A). 

t (Xf, ExitFace) := OutCellCH, q, Xo,u) computes the intersection point ^/ of the trajectory 
X[Xo,m] of the hybrid automaton 7i with the state cell Dq. 
satisfying: Xf G ExitFace. 
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3.2.2 Expansion of the convex approximation and approximation er- 
ror 

Let us consider a finite sequence of adjacent discrete modes of the hybrid au- 
tomaton Ti. satisfying Vje{l,...,r- — 1}, D^. n D^.^^^ ^ and Tf C Dg^ . As in 
section 3.1.2, we introduce the set fly associated to the path 7 and defined by: 

r 

= y Dq. . According to proposition 10, the hybrid controllable domain in fl^ 

i=0 

can be under-approached by the union of local controllable domains successively 
defined in the cells Z?^. of the path 7: 

C^in^) = Ch{To, Dg,) U Cn{ri,Dg,)\J . . . 

UCni%,Dg^) cCnii^^) (11) 

where Tq. denotes the local target in mode qi. Similarly, wc define an approxi- 
mation A(f2^) of Ci-^{Q-y) by successive local convex approximations: 

A{ny)=A{To,Dq,)UA{Tl,Dq,)U---UA{Tr,Dq^) (12) 

where A(ri, Dq.) denotes the convex approximation computed by the algorithm 
1 and (Ti)i=o...r is the sequence of local targets defined along the path 7 by 
recurrence as follows: 

/ ^0 = (13) 

\ T,+i = A(T,,D,JnG(,,,5,^,), i = 0,...,r ^ ^ 

We easily check that the targets r^, i = 0, . . . , r are polytopes (i.e. bounded and 
convex polyhedra). Furthermore, each local target Ti is a convex approximation 
of Tq., so that A{Ti, Dq.) is actually a convex approximation of the controllable 
domain Ch (Tq. , Dq. ) : 

Proposition 12 For alii = Q, ... the local targets ti defined by the recursive 
relation (13), are respective convex approximations of targets Tq^ and A{Ti, Dq.) 
is a convex approximation of order ^/nh of the controllable domain CuiTi, Dq.) 
in mode qi. 

Moreover, if the sets C-h {Tq. , Dq. ), i ^ 0, . . . ,r, are convex then: Vi = 0, . . . , r, C 
Tq. and A{Ti , Dq^ ) c Cn {Tq. ,Dq.). 

Now, we want to evaluate the approximation error, namely the HausdorfF dis- 
tance dff (C-H(ri^), A(f2^)) between the controllable domain C-H(ri7) and its piece- 
wise convex approximation A{fly). Since C-j-i^{n.y) is an under-approximation of 
CTi,{^'y) and according to the decompositions (11) and (12), we deduce: 

dH{Cn{ny),A{ny)) < dH{CH{ny), A{ny)) 

< max dH {Cn {Tq^ ,Dq^), A(r, ,Dq^)) 

i—0...r 

Moreover, for any cell , i ~ 0, . . . , r, the distance of the local controllable 
domain CH{Tq., Dq.) to its convex approximation A{Ti,Dq.) is bounded by the 



24 



size of the mesh 2? of the state space. Hence, according to 5, we have: Vz = 
1, . . . , r, dH{Cn(Tqi, Dq^), A(Ti, DgJ) < ^Jnh and we can conclude by: 



3.3 Experience on the non linear spring 

In this section, we consider the smah example of the nonlinear spring presented 
in [44, paragraph 7.3.1]: 



where the control constraints arc: Vt > 0, \u{t)\ < 1. In this section, wc want 
to apply the previously described algorithms to compute a controllable approx- 
imation of the set of initial positions {xo,yo = ig), from which the spring can 
be steered to its equilibrium position (0,0). 

In this example, we perform the convex approximation algorithm on three suc- 
cessive adjacent cells. Figure 11 shows the three successive hybrid controllable 
approximations for the discretization step h ~ 1. Figurc3.3 shows the result of 
our algorithm at three different discretization level h € {2, 1, i}. The outside 
blue curve delimits the real nonlinear controllable domain over the three con- 
sidered cells: as we can observe, the hybrid approximations tend to the exact 
nonlinear domain when h decreases. 




Figure 11: Successive controllable approximations inside three adjacent cells for 

h = l. 



3.4 Orbital transfer 

We now consider a satellite in orbit around the Earth. Our control problem is the 
minimum time transfer of this satellite from an initial orbit to a geostationary 
one. 

Let m be the mass of the satellite and u its thrust. The satellite dynamic is 




(14) 
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Figure 12: Convex controllable approximations at scales h — 2, h ~ 1 and 
h — 1/2. Our approximations tend to the exact controllable domain delimited 
by the blue curve. 
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described by the equations: 



m = 

where /i denotes the gravitation constant. The control u is assumed bounded: 
||u|| < Fmax- The numerical values of the involved physical constants arc given 
in Table 1. 



u 

m ' 



(15) 
(16) 



Parameter 


Value 




5165, 8620912 Mm^.h-^ 


Too 


1500 kg 


Fmax 


3 N 



Table 1: Physical constants for the orbital transfer problem 



The mathematical model of the satellite is a variable mass three dimen- 
sional model. Simplified models have been proposed such as coplanar transfer 
[14] and/or constant mass models [12]. For each model, geometrical studies have 
been proposed to analyze the controllability properties and the structure of ex- 
tremals. For the sake of simplicity, we restrict our analysis to coplanar transfers 
at constant mass. The satellite dynamic is described by equations expressed in 
radial-orthoradial coordinates (P, e^,, e^, L) as follows: 



" p ' 





















sin L 


Cy 


V p 







—cos L 


L 













2PIW 
cos L + (ej; + COS L)/W 
sin L+ {cy + sin L)/W 




Ul 



U2 



(17) 



where: W ~ l + e^cos L + Cysin L. Variables (P, e) are respectively the parame- 
ter and the eccentricity vector of the osculating ellipse and L the longitude. The 
variable change between the Cartesian and radial-orthoradial frames is defined 
by: 



ri 



r2 



P 



W 



-sin L 



V2 




L) 



(18) 



cos L) 
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where v = r denotes the satelhte velocity. From now on, the control constraints 
are expressed as: 

|wi| + |"2| < Fynax- 

and the control domain U„i could be defined by: U,„ = Conv {[-Fmax-, 0], [0, -F^ax], [Fmax, 0], [0, F^ax]) ■ 

The control problem is to minimize the time transfer of the satellite from 
a given initial orbit to a given final one, the position of the satellite on this 
last orbit being free. The related initial and final conditions arc summed up on 
Table 2. 

Initial conditions Final conditions 

= 11.625 Mm Pf = 42.165 Mm 
e° = 0.75 4 = 

e« = el = 

= TT rad Lf free 



Table 2: Initial and final condition for the orbital transfer problem 



Let us now consider an initial point Xq = [11.625, 0.75, 0, tt] expressed in 
orbital coordinates. At the rougher scale h = 6, our algorithm 1 computes 
451 controllable points in 4 minutes, building a four-dimensional approximation 
made of 104 vertices. By projection, we could obtain for example: 

• the attainable set of the eccentricity e from the initial point cq = (0.75, 0) 
(see figure 13- (a)). 

• the three dimensional attainable set of parameters {P,ey,L) (sec figure 
13-(b)). 

Using variable change (18), these numerical results could be translated in Carte- 
sian coordinates (ri,r2) and give an approximation of the attainable set from 
Xq into the Earth Cartesian frame (see figure 14). 

Observe that since the Earth radius rgis about 6 kilometers, collision with 
the Earth is possible. To avoid this configuration, we could add a constraint like 
||r|| > rs- we would then have to redefine the involved cells of the state mesh 
and this way, to restrict the controllable domain. 

4 Approaching nonlinear optimal solutions by 
optimal solutions of the hybrid control prob- 
lem 

In this section we focus on the resolution of the nonlinear optimal control prob- 
lem via the hybrid approach. Let Um be a convex and compact polytope in M™ 
such that: G U„j: in section 2.3, we shaw that any nonlinear control system 
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(a) 




km 



Figure 14: Approximation of the satellite attainable domain at scale ft, = 6 in 
the Cartesian frame (ri,r2) 



like (1) ean be approached by a piecewise af&ne hybrid model. This enables us 
to define two optimal control problems related to each control system: 

(Vnl) Minimize the cost function J{Xo,u) = J^^ l{X{t),u{t))dt with respect 
to the control u under the nonlinear dynamic: 

x{t) = f{x{t),u{t)) x(t\^r, 

and the constraints: G [0,tf],u(t) G Um- The final time tf is unspecified. 

(Vh) Minimize the cost function J(X(),u) = J^^ l{X(t),u(t))dt with respect 
to the control u under the piecewise affine hybrid dynamic: 

x{t) = f^{x{t),u{t)) x(t)^r 

and the constraints: Vt G [0,tf],u{t) G Vm- The final time tf is unspecified. 



As for the controllability problem, a key point of our hybrid approach is first 
to prove that solutions of the hybrid problem {Vh) are satisfactory approxima- 
tions of those of (Vnl)- We then address the problem of solving the hybrid 
optimal control problem {Vu)'- a powerful tool for optimal trajectories analysis 
is the (PMP) Pontryagin maximum principle [37]. First, in section 4.1, we state 
a suitable maximum principle for our hybrid optimal control problem {V-h) a-nd 
then deduce in section 4.2 the general structure of hybrid optimal trajectories. 
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4.1 Hybrid rriciximum principle 



In the classical context of smooth systems, the Pontryagin maximum principle 
provides first-order necessary optimality conditions and a pseudo-Hamiltonian 
formulation of the considered optimal control problem. It has been first stated 
by Pontryagin and al. in [37]. Stronger versions have been stated especially in 
the context on constraints on the control. References are numerous ; one can 
refer to [37, 33, 18, 1] for a proof of the maximum principle or to [11, paragraphe 
1.5] and [44] for a detailed study of its applications. 

The main difficulty of establishing a hybrid maximum principle is to manage 
discontinuities generated by the discrete component of the hybrid system. Today 
several formulations has been proposed depending on the structure and the 
regularity of the studied hybrid systems [42, 45]. Especially P. Riedinger, C. 
lung and F. Kratz state and prove a hybrid maximum principle for a general 
class of hybrid systems in [38, theorem 2.3]. Next our hybrid maximum principle 
is inspired from their theorem. 

Theorem 1 (Hybrid maximum principle) We introduce the Hamiltonian 
Hh related to the hybrid model 7i defined in section 2.3: 

Hh{X,u,\) = l{X,u) + \' UX,u). 

If the control u is optimal on the time interval [0,i/], then there exists a piecewise 
absolute continuous application A : [0,i/] —^ M" such that: 



For almost all t g [0,i/], H reaches its minimum with regards to the 
control at point u{t) i.e.: 

Hh{X{t),u{t),X{t)) = min Hh{X{t),v, X{t)). 
. Xit)^^iXit),u{t),\{t)) 

• A(t)^ = — - — -{X{t),u{t)^\{t)) (Euler- Lagrange equation) 

oX 

• Hh{X{t),u{t), X{t)) = along the optimal trajectory. 

• At each transition instant ti between two modes Qi and Qi+i, we have the 
following transversality conditions: 

A(t+)-A(t7) 

Hh{X{t+), u{t+), X{tt)) = Hn{X{t-l u{t-),X{t-)) 



We first recall that by definition, the state variable is continuous at each 
transition time ti: X{tf) = X(t~), which together with the autonomous feature 
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of the hybrid system, implies the contimiity of the hybrid Hamiltonian at each 
transition time. 

Then transitions between discrete modes of our hybrid automaton are non 
constrained: at each transition time ti between (at least) two modes qi and 
Qi+i, our hybrid system could either switch to mode gi+i, or stay in mode 
Qi, depending on the value of the field vector fh at X{ti). This implies the 
continuity of the adjoint vector A at each transition time. 

4.2 Characterization of hybrid optimal trajectories 
4.2.1 Necessary optimality conditions in mode q 

Let Xq e M" be a controllable point by our hybrid system H and q the corre- 
sponding discrete mode satisfying: Xq e Dq. We assume that the initial time 
t = is not a transition time of the system, which means that Xq does not 
belong to any guard of H (in other words Xq e int Dq). This assumption guar- 
antees that any (state) trajectory steered from Xq remains inside the cell Dq 
on a non empty time interval. 

Our point is now the local resolution (inside Dq) of the optimization problem 
stated by the hybrid maximum principle. According to this principle, any state 
and control trajectory candidate to optimality, also called extremal, in mode q 
has to fulfill three conditions: to minimize the hybrid Hamiltonian, to enable the 
definition of an appropriate adjoint state vector A and to cancel the Hamiltonian 
along this trajectory. 

Minimization of the hybrid Hamiltonian Hh in mode q In paragraph 
2.3.2, we defined the set of local control constraints at a given position X in one 
mode q by introducing the sets: 

yx e Dq,yq' e IC{q), Uq,{x) = e U™; {x,u) e A,,}. 

where lC{q) denotes the set of indices of state-control cells of the mesh A whose 
projection is exactly Dq. Therefore (C/g' (^))q'GA:(g) forms a partition of the 
control domain Um i-e.: 

VXei?„U™= y Uq^X) (19) 

q'eKiq) 

Now, by definition of the hybrid system 7Y, between two successive transition 
instants ti and ii+i, the state trajectory X{t) remains in cell Dq, the state and 
control trajectory {X{t),v) belongs to A^/ for all v € Uqi{X{t)) and the hybrid 
dynamic at any point {X{t),v) for v € Uqi{X{t)) is affine: V< G [t,;, t^+i], Vw G 
Uq'{X{t)),fhiX{t),u{t)) = Aq,X{t) + Bq,v + Cq- . Together with (19), we then 
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have: 

inf H{X{t),v,X{t)) 



= min I min H (X (t) , v , X(t)) 

q'eK{q) \veu^,{x{t)) 

= min ( min l{X{t)) + X{ty h(X{t),v) 
= .rr'hm^ + HtVA,,X{t) 

+ X{tYcq, + X{tyBq,V 

Let us consider the inner minimization problem for a given q' G IC{q). The 
objective is afiine in the optimization variable v. Moreover, according to the 
proposition 7, the set Uq'{X{t)) is a m-simplex whose vertices are denoted by 
ai{t) — FiX{t) + gi, i — 1, . . . ,m + 1. The induced constraints on the control 
V are therefore affine and the inner minimization problem actually is a linear 
program whose solution is well-known: 

v*{t)^F,X{t)+g^ 
if yj ^ I, X{tYB,, m - Fj)X{t) + g, - gj] < 0. 

We observe that the switching conditions explicitly depend both on the state X 
and the adjoint state A. 

Remark 3 (Switching curves and singular controls) From now on, we as- 
sume that optimal controls are non singular or equivalently, that zeros of each 
switching function 

5,,, : t ^ XitYBq, [{F, - Fj)X{t) + g, - g,] 

where i, j = 1, . . . ,m,i j , are isolated. 

Consequently our minimization problem in mode q could be replaced by the 
simultaneous solving of a finite number (exactly card JC{q)) of linear programs: 
the searched optimal controls are feedback (more precisely they are piecewise 
afhne in the state variable X). Notice that the choice of each piece FX + g also 
depends on the adjoint state variable A. 

The next step is to check that such controls fulfill the missing optimality 
necessary conditions given by the hybrid maximum principle. Let us now intro- 
duce a given affine feedback control u = FX + g and the resulting trajectory in 
mode q. As illustrated on figure 15, we now focus on a time interval [toi^i] on 
which the considered trajectory {X, FX + g) remains in a state-control cell Agi 
for some q' € IC{q): 

yt e [to,ti],{X{t),FX{t) + g) e A,,. 
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.u = FX + g 

• u ~ FX+g a given affine feedback 
control. 

• X the resulting trajectory solu- 
tion of: 

X(t„) Xit,) X, Xit)^MXit),FXit)+g). 
Figure 15: Definition of an affine feedback control u = FX + g in mode q 




It follows that the hybrid dynamic is affine along the trajectory {X, FX + g) G 
Aqt i.e. for all t S [ioi^i]: 

h{X{t),FX{t) + g) = {A,, + B,,F) X{t) + Bg,g + c,, . 

Definition of the adjoint vector A in mode q The adjoint parameter is 
then simply defined as solution of the following first order differential equation: 



dl 
dX 



iX{t))-XitfiA + BF) 



(20) 



for all t €]io,ii[- According to our regularity assumptions, the cost function I 
is continuously differentiable on R" x Um, so that 1 1-^ -^{X{t)) is continuous. 
This implies the expected absolute continuity of the solution A. Notice that no 
initial condition on A is required. 



Hybrid Hamiltonian equal to along the optimal trajectory Lastly 
we have to check that the hybrid Hamiltonian is equal to along the considered 
trajectory {X, FX + g). We write for all t <E [to, ti]: 

H{X(t),FX{t)^ + g,X{t)) 

= liX{t)) + X{ty Ua + BF)X{t) +Bg + c 

= liX{t)) + Xit)'^X{t). 
Using X{t) ^ {A + BF)X{t) and relation (20), we prove 

^HiXit),uit),Xit)) 

= —iX{t))X{t) + A(t)^X(t) + Xit)'^X{t) 

= -^{X{t))X{t) + X{ty{A + BF)X{t) 

^{X{t)) + X{ty{A + BF)^X{t) 
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for all t E [to,ti]. That means that the Hamiltonian is constant along the 
considered trajectory {X, FX+g, A). Thus to fulfill the last optimality necessary 
condition required by our hybrid maximum principle, we just have to cancel the 
hybrid Hamiltonian at the initial time i.e.: 

l{X{to)) + X{to)^ [{A + BF)X{to) + Bg + c] = 0. (21) 

This gives an explicit constraint on the adjoint state variable A at the initial 
time to- 

In conclusion that fcedback affine controls are good local candidates to op- 
timality. Next we propose a strategy to globally reconstruct the extremals of 
our hybrid optimal control problem (V-h). 

4.2.2 Global structure of hybrid extremals 

Let {X, u, A) be an extremal of the hybrid optimal control problem {V-h) for 
the initial condition X{0) = Xq. By definition 2 of an hybrid trajectory, there 
exists a sequence of time intervals ([tj, ti+i])i=o...r and a sequence of discrete 
modes {qi)i=o...r+i such that: 

Wte[U,t,+i],X{t)£Dq^ (22) 

The transversality conditions stated by the hybrid maximum principle together 
with the definition of our hybrid model, guarantee the continuity of both the 
state X and the adjoint state A at each switching time ti between two discrete 
modes. There just remains to determine the structure of trajectories inside each 
mode Qi. 

Structure in mode qi We focus on one discrete mode qi of our hybrid au- 
tomaton H and introduce the set E,. (X) of the vertices of the control domain 
Uqi{X) induced at position X in mode g^: 

E,,(X) = {FX + geVm/^q' (^IC{q^), 

{FX +5) is a vertex of Uq,{X)} C U^. 

According to results presented in paragraph 4.2.1, the optimal control u is locally 
the solution of a linear program and can be expressed as an explicit piecewise 
affine function of the state X, each piece's choice implicitly depending on the 
adjoint vector A: therefore there exists a subdivision (tj)j=o...Ari of the time 
interval ]ti, such that for all j e {0, . . . , A'^^ — 1}: 

3(F,,.gj) e M,„.„ X R™,Vt G [%,tj+il , . 

u{t)^FjX{t)+g,e^q^{X{t)). ^ ' 

Hence: {X,FjX + gj) E Aqi on In other words, as illustrated on figure 

16, the optimal trajectory {X, u) evolves along the edges of the state and control 
mesh A, continuously in X and not in u. 
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u 




Figure 16: Hybrid extremals evolve along edges of the state and control mesh 
A 



The main issue is so to efficiently compute both the optimal control and 
the adjoint state variable A at any given position X. Indeed Euler-Lagrange 
equations determining A depend on the control choice, which in turns depends 
on A. Therefore we need to determine u = FX+g and A satisfying the optimality 
conditions without having to compute any trajectories. 

Switching conditions and optimal control computation Let us consider 
our hybrid optimal control problem at a given time tj. The state Xj = X(tj) 
and the adjoint state Xj = X{tj) are supposed to be known. According to (23), 
we want to determine control candidate to optimality satisfying u = FX + g in 
the set 'Sq.{X). We also need to compute the first time tj+i > tj at which the 
optimality conditions are not fulfilled any more. 

We assume that the control u = FX+g satisfies all the optimality conditions 
required by the hybrid maximum principle at time tj: 

Hh {Xj , FXj + .g, Aj ) = min Hh {X, , u, Aj ) 

min HhiXj,F'Xj+g',Xj) 

F'xj +y 

Hh{Xj,FXj+g,Xj) = 

This induces that switching functions could be expressed via the hybrid Hamil- 
tonian this way: 

Spx+g.F'x+g' = A^ [MX, FX + g) - MX, F'X + g')] 
^-1{X)-X^ MX,F'X + g') 
= -Hh{X,F'X + g',X) < 

Therefore switching conditions are obtained by studying the zeros of the ap- 
plication: u 1^ Hh{Xj,u,Xj) over the finite set Yjq.{Xj). We thus compute: 
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Hh{Xj, FXj +g, Xj) for all FXj+g G T,q. (Xj). When the latter equals zero, the 
corresponding control is a good candidate to optimality. The main advantage 
of this approach is that wc only need card E^^ (Xj), i.e. at most m + card IC(qi), 
function evaluations of {Xj ,-,Xj). 
We then introduce: 

tj+i = min (t,+i; 

inf{t > t,;Hh{X{t),FX{t)+g,X{t)) > 0}) 

where {X, A) denotes the state and adjoint state trajectory steered from {Xj, Xj) 
at time tj according to the control u = FX + g. In other words, ij+i denotes 
either the exit time, namely i^+i, from the current mode qi or the first instant 
at which optimality conditions are not satisfied anymore. At time tj+i, we have 
new initial conditions (Xj+i, Aj+i) = {X{tj+i), X{tj+i)). 

4.3 Simulation algorithm of hybrid extremals 

For a given initial condition of the adjoint state, we propose an algorithm to 
compute extremals of the hybrid optimal control problem i.e. trajectories sat- 
isfying the necessary optimality conditions of the hybrid maximum principle. 
Algorithm 2 details this computation of a hybrid extremal in a given mode qi. 

The global simulation algorithm of hybrid extremals relies on the computa- 
tion of discrete modes reached by the trajectory satisfying the necessary optimal- 
ity conditions and on the transversality conditions that guarantee the continuity 
of the adjoint state variable A. The principle is as follows: starting from a given 
initial condition {to,Xo, Xq) in mode qo and using algorithm 2, wc compute the 
trajectory inside Dq^ satisfying the hybrid maximum principle and its exit time 
from the cell Dg^ towards an adjacent cell Dq-^ . By continuity of the state and 
adjoint state, the exit conditions give new initial conditions {ti, Xi, Ai) in mode 
qi and we pursue the simulation the same way. 

5 Conclusion 

In this paper, we address the problem of hybrid optimal control of nonlinear 
dynamical systems. First we propose a hybrid approximation, by way of a hybrid 
automaton with picccwise affinc dynamics, of complex systems. We therefore 
define a picccwise affine optimal control problem and then focus on two main 
problems in control theory: the controllability question and the computation of 
optimal solutions. 

For the controllability, we have given a convex approximation of the con- 
trollable set which is at a guaranteed distance of the actual controllable set. 
Furthermore, when the controllable set is convex, this approximation is an 
under-approximation and thus guarantees controllability. Wc then proposed 
and implemented an algorithm to compute this approximation and its efficiency 
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Algorithm 2 Extremal computation in a given discrete mode qi 
Require: ti > initial time in mode qi at the position Xi E Dq- for the given 
initial condition: X(ti) = A,;. 
1: Initialization, j = 0; to = ti', = Xi] Aq = A^; 

2: Control computation. Find the control FXj + g E Sg. (Xj) satisfying the 
necessary optimality conditions at time tj i.e.: 

HhiXj,FXj+g,Xj) - 
HhiX,,F'X,+g\X,) > 

for aU F'Xj + g' G S,. {Xj)\{FXj + g}. 
3: State and adjoint state trajectories. Compute the state Xj = 
X[Xj,FX + g] and adjoint state Xj = X[Xj,FX + g] trajectories respec- 
tively steered from Xj and Xj according to the control u{X) = FX + g: 

X{t)^ fh{X{t),FX{t)+g), X{U)^Xj 

X{ty = -^{X[X,,FX + g]{t), 

FX[Xj,FX + g]{t)+g,X{t)) 

X{U) = A, 

4: Compute the exit time ti+i of X[Xj, FX + g] from the cell Dg. and the next 
cell Dq.^-^ at time t^+i. 

5: Switching time. Introducing the Hamiltonian along the considered tra- 
jectory, compute 

tj+i = mi{t> tf,Hh{X[X,,FX + g]{t), 

FX[Xj,FX + g]it) + g, X[X„FX + g]{t)) > 0} 

6: if tj+i exists and ij+i < i^+i then 

7: Go back to step 2 with new initial conditions: ij+i, ^j+i = X[Xj, FX + 
g]{tj+i) and A^+i = X[Xj,FX + g]{tj^i) until tj^i ~ t^+i i.e. until the 
state leaves the cell Dq.. 

8: j =j + 1; 

9: end if 
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and accuracy are shown on the examples of the nonlinear spring and of the 
change of orbit. 

Then we addressed the question of optimality. By developing an hybrid max- 
imum principle we are able to characterize the hybrid optimal trajectories and 
to prove that the optimal control have the form of afhne feedbacks. Moreover, 
the principle gives us a way to compute the switching times between modes and 
thus an algorithm to follow the optimal trajectories. 

Further improvements include mixing this approach with shooting methods 
in order to get a better initialization of the adjoint vector. 
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A Proof that the control constraints are m-simpUces 

We take q' £ K.{q). The problem is to compute the intersection of the affine 
subspace P = {{Xq,u)/u e M"'} in M"+™ and the simplex Aqi (see figure 17): 

[/,.(Xo) = {{Xo,u) I u G R"} n A,, = pn A,, 

We know that the intersection of an affine subspace with a simplex is a simplex 
[47, Theorem 1.1]. Therefore f7g/(Xo) is a simplex in R™. There remain to to 
compute its dimension. The idea is to consider the intersection of the subspace 
P of dimension m with the n-faces of Ag/. These intersections, when they exist, 
are of dimension 0, and define the vertices of f7g'(Xo). We then have to prove 
that there exists exactly (m + 1) different n-faces intersecting P to conclude. 
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Figure 17: Intersection Pn Agr for {n,m) — (1,2) on the left, {n,m) ~ (2,1) 
on the right. 



Let {Xi, . . . , Xn+i} be the vertices of Dg. By construction, we have: (Aq') 



Dg. We can then introduce the vertices of Aqi: 



Vk - 
k = 1 



^3k 



,ik e {l,...,n + l},jfc e {l,...,m+l}, 



,n + m + 1 



and: {Xi^;k = 1, . . . , n + m + 1} {Xi, . . . ,X„+i}. 
We also assume: Xq G Dg, hence: 



n+l 



n+1 



3!(afc)fe=i...„+i e]0, ir+\ ^ «fe = 1 and = ^ akXk 



k=l 



(24) 



I''* step: any n-face of Agi has either one intersection with P, or 
none. 

Let P be a n-facc of Agi defined by the list (renumbered to ease the readability) 
of its vertices: ( )fe=i,...,„+i where u € {1, n+l}, jfc e {1, m+1}. 



• I''* case: the ik are all different. 



We then have: {X,^ / k ^ I, . . . ,n + 1} = {Xi, . . . , X„+i}. (24) thus be 

n+l n+l r V 1 -^0 

comes: Xq = J2 (^ik^ik- We state: Xp = ^ a^^ 



fe=i 



k=l 



X,. 



S ^ik'^ik 

fe=l 



Then: Xp € P- Moreover, by construction of Xp, X^ E Dg involves: 
Xf G F, hence: Xp ePnF. 

By uniqueness of the convex decomposition in a given simplex, we then 
deduce: 

Xf e F and PDF ^ {Xp} 

• 2"'^ case: the ik are not all different. 

We then deduce: {X,^ / fc = 1, . . . , n + 1} C {Xi, . . . , X„+i}. For Y e F, 
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we therefore have: 



n+l 

E 7.. = 1 
3!(7.J;c=i...„+i e [0, <( „+i 

k=l 



X,. 



However, according to the hypothesis: Xq e Dq, Xq depends on all the 

n+l 

Xi^ i = + 1. It follows: ^ -fi^Xi^ ^ Xq, i.e.: Y ^ P, and: 

k=l 

Fnp = 0. 



2""^ step: there exist exactly (m + 1) n-faces intersecting P. 

By construction, we know that: {Xi^^ / k — 1, . . . , n + m + 1} = {^i, . . . , X„+i} 
; after renumbering the vertices of Agi , we can assume: 



yi e {!,..., n + 1}, Vi 



Xi 



• We state: Fq = Conv{{Vi, . . . , Vn+i})- By affine independence of Xi, . . . , Xn+i 
of Dq, the vertices Vi, . . . , Vn+i are also affinely independent, so that Fq 

is actually a n-facc of Ag/ intersecting P. 

• For 7T, + 2<^<7i + m + l (i.e. m possible values for /), we state: 

Fi - Conv {{{Vi, . . . , K+i} - {T^J) U {Vi}) , 
1 < < n + 1 

where: Vi = {Xii^Uji) and 1^, ~ {Xi^^Uj.^). As for Pq, P; is a n-facc of 
Ag' which intersects P. 

To conclude, wc have found (1 + m) n-faces of A^/ which intersect P, i.e. (m-|- 1) 
vertices of Uqi^Xo). 
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